459 research outputs found

    Performance analysis of a parallel, multi-node pipeline for DNA sequencing

    Get PDF
    Post-sequencing DNA analysis typically consists of read mapping followed by variant calling and is very time-consuming, even on a multi-core machine. Recently, we proposed Halvade, a parallel, multi-node implementation of a DNA sequencing pipeline according to the GATK Best Practices recommendations. The MapReduce programming model is used to distribute the workload among different workers. In this paper, we study the impact of different hardware configurations on the performance of Halvade. Benchmarks indicate that especially the lack of good multithreading capabilities in the existing tools (BWA, SAMtools, Picard, GATK) cause suboptimal scaling behavior. We demonstrate that it is possible to circumvent this bottleneck by using multiprocessing on high-memory machines rather than using multithreading. Using a 15-node cluster with 360 CPU cores in total, this results in a runtime of 1 h 31 min. Compared to a single-threaded runtime of similar to 12 days, this corresponds to an overall parallel efficiency of 53%

    Illuminating Choices for Library Prep: A Comparison of Library Preparation Methods for Whole Genome Sequencing of Cryptococcus neoformans Using Illumina HiSeq.

    Get PDF
    The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina), along with two new kits: the TruSeq Nano DNA kit (Illumina) and the NEBNext Ultra DNA kit (New England Biolabs) to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality) being considered when ultimately deciding on which library prep method to use

    The evolution of strong reproductive isolation between sympatric intertidal snails

    Get PDF
    The evolution of strong reproductive isolation (RI) is fundamental to the origins and maintenance of biological diversity, especially in situations where geographical distributions of taxa broadly overlap. But what is the history behind strong barriers currently acting in sympatry? Using whole-genome sequencing and single nucleotide polymorphism genotyping, we inferred (i) the evolutionary relationships, (ii) the strength of RI, and (iii) the demographic history of divergence between two broadly sympatric taxa of intertidal snail. Despite being cryptic, based on external morphology, Littorina arcana and Littorina saxatilis differ in their mode of female reproduction (egg-laying versus brooding), which may generate a strong post-zygotic barrier. We show that egg-laying and brooding snails are closely related, but genetically distinct. Genotyping of 3092 snails from three locations failed to recover any recent hybrid or backcrossed individuals, confirming that RI is strong. There was, however, evidence for a very low level of asymmetrical introgression, suggesting that isolation remains incomplete. The presence of strong, asymmetrical RI was further supported by demographic analysis of these populations. Although the taxa are currently broadly sympatric, demographic modelling suggests that they initially diverged during a short period of geographical separation involving very low gene flow. Our study suggests that some geographical separation may kick-start the evolution of strong RI, facilitating subsequent coexistence of taxa in sympatry. The strength of RI needed to achieve sympatry and the subsequent effect of sympatry on RI remain open questions. This article is part of the theme issue ‘Towards the completion of speciation: the evolution of reproductive isolation beyond the first barriers'

    Defects in the acid phosphatase ACPT cause recessive hypoplastic amelogenesis imperfecta

    Get PDF
    We identified two homozygous missense variants (c.428C>T, p.(T143M) and c.746C>T, p.(P249L)) in ACPT, the gene encoding Acid Phosphatase, Testicular, which segregate with hypoplastic Amelogenesis imperfecta (AI) in two unrelated families. ACPT is reported to play a role in odontoblast differentiation and mineralisation by supplying phosphate during dentine formation. Analysis by computerised tomography and scanning electron microscopy of a primary molar tooth from an individual homozygous for the c.746C>T variant, revealed an enamel layer that was hypoplastic but mineralised with prismatic architecture. These findings implicate variants in ACPT as a cause of early failure of amelogenesis during the secretory phase

    Spontaneous development of Epstein-Barr Virus associated human lymphomas in a prostate cancer xenograft program

    Get PDF
    Prostate cancer research is hampered by the lack of in vivo preclinical models that accurately reflect patient tumour biology and the clinical heterogeneity of human prostate cancer. To overcome these limitations we propagated and characterised a new collection of patient-derived prostate cancer xenografts. Tumour fragments from 147 unsupervised, surgical prostate samples were implanted subcutaneously into immunodeficient Rag2-/-ÎłC-/- mice within 24 hours of surgery. Histologic and molecular characterisation of xenografts was compared with patient characteristics, including androgen-deprivation therapy, and exome sequencing. Xenografts were established from 47 of 147 (32%) implanted primary prostate cancers. Only 14% passaged successfully resulting in 20 stable lines; derived from 20 independent patient samples. Surprisingly, only three of the 20 lines (15%) were confirmed as prostate cancer; one line comprised of mouse stroma, and 16 were verified as human donor-derived lymphoid neoplasms. PCR for Epstein-Barr Virus (EBV) nuclear antigen, together with exome sequencing revealed that the lymphomas were exclusively EBV-associated. Genomic analysis determined that 14 of the 16 EBV+ lines had unique monoclonal or oligoclonal immunoglobulin heavy chain gene rearrangements, confirming their B-cell origin. We conclude that the generation of xenografts from tumour fragments can commonly result in B-cell lymphoma from patients carrying latent EBV. We recommend routine screening, of primary outgrowths, for latent EBV to avoid this phenomenon

    RNAseq Analyses Identify Tumor Necrosis Factor-Mediated Inflammation as a Major Abnormality in ALS Spinal Cord

    Get PDF
    ALS is a rapidly progressive, devastating neurodegenerative illness of adults that produces disabling weakness and spasticity arising from death of lower and upper motor neurons. No meaningful therapies exist to slow ALS progression, and molecular insights into pathogenesis and progression are sorely needed. In that context, we used high-depth, next generation RNA sequencing (RNAseq, Illumina) to define gene network abnormalities in RNA samples depleted of rRNA and isolated from cervical spinal cord sections of 7 ALS and 8 CTL samples. We aligned \u3e50 million 2X150 bp paired-end sequences/sample to the hg19 human genome and applied three different algorithms (Cuffdiff2, DEseq2, EdgeR) for identification of differentially expressed genes (DEG’s). Ingenuity Pathways Analysis (IPA) and Weighted Gene Co-expression Network Analysis (WGCNA) identified inflammatory processes as significantly elevated in our ALS samples, with tumor necrosis factor (TNF) found to be a major pathway regulator (IPA) and TNFα-induced protein 2 (TNFAIP2) as a major network “hub” gene (WGCNA). Using the oPOSSUM algorithm, we analyzed transcription factors (TF) controlling expression of the nine DEG/hub genes in the ALS samples and identified TF’s involved in inflammation (NFkB, REL, NFkB1) and macrophage function (NR1H2::RXRA heterodimer). Transient expression in human iPSC-derived motor neurons of TNFAIP2 (also a DEG identified by all three algorithms) reduced cell viability and induced caspase 3/7 activation. Using high-density RNAseq, multiple algorithms for DEG identification, and an unsupervised gene co-expression network approach, we identified significant elevation of inflammatory processes in ALS spinal cord with TNF as a major regulatory molecule. Overexpression of the DEG TNFAIP2 in human motor neurons, the population most vulnerable to die in ALS, increased cell death and caspase 3/7 activation. We propose that therapies targeted to reduce inflammatory TNFα signaling may be helpful in ALS patients
    • 

    corecore